Case(Control)-Free Multi-SNP Combinations in Case-Control Studies
نویسندگان
چکیده
Several genome-wide searches for disease-associated gene variations have been recently reported (Spinola et al, 2006; Herbert et al, 2006). However, complex diseases can be caused by combinations of several unlinked gene variations. Unfortunately, an exhaustive search among all possible corresponding multi-SNPs can be unfeasible even for small number of SNPs let alone the complete genome. Disease association analysis searches for a SNP with frequency among diseased individuals (cases) considerably higher than among non-diseased individuals (controls). In this work we first explore the problem of searching for the most disease-associated and the most disease-resistant multi-gene interactions for a given case-control study. In the previously published work [3] we proposed fast complimentary greedy search (CGS) which finds multi-SNP combinations with non-trivially high association on real data. Here we present two new approaches which are slightly slower than CGS but aimed to find more MSCs associated with a disease. One is k-level CGS (k-CGS) which is a modification of CGS with fixed k SNPs. Second one is k-level Alternating Closure Search (k-ACS) which starts with a set defined by k-length MSC found by the exhaustive search and minimizes its p-value by reducing the subset of controls and keeping the subset of cases as large as possible. We compare proposed methods with CGS on case/control datasets of Crohn’s disease (Daly et al, 2001), autoimmune disorder (Ueda et al, 2003), tick-borne encephalitis (Barkash et al, 2006), and rheumatoid arthritis (GAW15 NARAC 18q, 2006). When dealing with common diseases, it is necessary to search and analyze multiple independent causes each resulted from interaction of multiple SNPs scattered over the entire genome. Exploiting k-CGS and k-ACS methods for searching associated risk and resistance factors, we address the disease susceptibility prediction problem. We use greedy version of proposed in [3] optimum clustering formulation to extract independent causes. Then we use model-fitting algorithm that transforms clustering algorithm into susceptibility prediction algorithm. We compare the accuracy of the proposed prediction methods (k-CGSP and k-ACSP) with previously known methods using leave-one-out (LOO) test.
منابع مشابه
P-244: Analysis of Genomic and Cell Free DNA of A let-7 microRNA Binding Site of KRAS Gene Polymorphisms in Endometriosis
Background: Endometriosis is one of the most common benign gynecological diseases which is characterized by endometriallike tissue growing outside the uterine cavity. Although the pathology of endometriosis remains unknown, the genetic predisposition plays an apparent role. Several genes have been contributed to endometriosis, but it seems KRAS has a crucial role, because its activation results...
متن کاملCompare the Relative Frequency of Thyroid Function Disorders in Obese and Overweight Children with Non-Obese Children; a Case-Control Study
BackgroundChildhood obesity has become a global epidemic. In the recent studies has reported risk of thyroid dysfunction due to obesity. Therefore, this study intends to compare the relative frequency of thyroid function disorders in obese and overweight children with non-obese children.Materials and MethodsThis case-control study during 2015 to 2016 in Birjand, Iran was done on 137 children an...
متن کاملGenetic polymorphisms in the estrogen receptor - α Gene codon 325(CCC}CCG) and risk of breast cancer among Iranian women: a case control study
Abstract Background: The Iranian breast cancer patients are relatively younger than their Western counterparts. Evidence suggests that alterations in estrogen signaling pathways , including estrogen receptor-α (ER- α ), occur during breast cancer development in Caucasians. Epidemiologic studies have revealed that age-incidence patterns of breast cancer in Asians differ from those in Cauca...
متن کاملA Knowledge-Based Method for Association Studies on Complex Diseases
Complex disorders are a class of diseases whose phenotypic variance is caused by the interplay of multiple genetic and environmental factors. Analyzing the complexity underlying the genetic architecture of such traits may help develop more efficient diagnostic tests and therapeutic protocols. Despite the continuous advances in revealing the genetic basis of many of complex diseases using genome...
متن کاملConditional likelihood methods for haplotype-based association analysis using matched case-control data.
Genetic epidemiologists routinely assess disease susceptibility in relation to haplotypes, that is, combinations of alleles on a single chromosome. We study statistical methods for inferring haplotype-related disease risk using single nucleotide polymorphism (SNP) genotype data from matched case-control studies, where controls are individually matched to cases on some selected factors. Assuming...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006